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This paper provides rate-efficient estimators of the volatility parameter 



1. Introduction. In this paper, we continue the study started in about 



t^h ' the estimation of parameters when one observes a Levy process X at n regularly 

spaced times A n , 2A n , . . . , nA n , with A n going to as n —* oo. In our earlier paper, 
we were concerned with the asymptotic behavior of the Fisher information, with 
the objective of establishing a benchmark for what efficient estimators are able to 
achieve in that context. Now, we wish to exhibit estimators which both achieve that 
rate and can be explicitly computed. 



(1) X t = aW t + Y t , 



in 
o 

, We want to estimate a positive parameter a, which we call volatility, in the model 

B 

> 

^ ■ where W is a standard Wiener process or, more generally, a symmetric stable process 

h : 

of index j3, and the process Y is another Levy process without Wiener part and with 
jumps "dominated" in a sense we make precise below by those of W. Allowing for 
jumps is of great interest in mathematical finance, in the diverse contexts of option 
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pricing, testing for the presence of jumps in asset prices, interest rate modelling, 
risk management, optimal portfolio choice, stochastic volatility modelling or for the 
purpose of better describing asset returns data (see the references cited in |2]). 

Our aim is to construct estimators for a which behave under the model Q "as 
well as" under the model 

(2) X t = aW t , 

asymptotically as A n — > and n — > oo. This is in line with the results of |2j, 
in which we proved that property for the Fisher information. In other words, we 
want to be able to estimate the volatility parameter a at the same rate when Y, 
a jump perturbation of W, is present as when it is not. In some applications, Y 
may represent frictions that are due to the mechanics of the trading process, or 
in the case of compound Poisson jumps it may represent the infrequent arrival of 
relevant information related to the asset. Given that both W and Y contribute to 
the overall observed noise in X, it is not a priori obvious that it should be possible 
to estimate a equally well (at least in the rate sense) with and without Y. Beyond 
the robustness to misspecification risk that such a result affords, it also for instance 
paves the way for risk management or option hedging that is able to target the a W 
risk" (continuous when (3 = 2) separately from the "Y risk" (discontinuous). 

We distinguish between a parametric case, where the law of Y is known, and a 
semiparametric case, where it is not. We show that, in the parametric case, one can 
find estimators which are asymptotically efficient in the Cramer-Rao sense, meaning 
that the asymptotic estimation variance is equivalent as n — > oo to the inverse of the 
Fisher information for the model © without the perturbation Y. This is possible 
when the law of Y is completely known. In the semiparametric case, where that 
law is unknown, obtaining asymptotically efficient estimators requires A n to go fast 
enough to 0; but we can then exhibit estimators that are efficient uniformly when 
the law of Y stays in a set sufficiently separated from the law of W. And in general 
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we can exhibit a large class of estimators which are consistent and achieve a specified 
rate (although not the efficient rate). 

A distinctive feature of the present paper is that we construct estimators which are 
as simple as possible to implement. For example, in the parametric situation where 
the law of Y is known, one can in principle compute the MLE, which is of course 
efficient. In practice, this is hardly feasible, as the likelihood function derived from 
the convolution of the densities of W and Y will in most situations not be available 
in closed form. So we provide a number of other - much simpler - estimators which 
are not as good (in the sense of not reaching the Cramer-Rao lower bound in general) 
but not too bad either (in the sense of achieving the efficient rate of convergence) . 

The paper is organized as follows. In Section |2 we specify our estimating setting. 
Section El is devoted to estimating equations: the estimators we propose all fall in 
that class and we state a general result which covers them all. Sections 0] and [5] arc 
devoted to the parametric and semiparametric cases respectively. Some examples 
are developed in Section El El and |H1 where we consider specific types of estimating 
equations such as the empirical characteristic function , power variations and power 
variations with truncation. 

2. The setting. With Xq = 0, we observe n i.i.d. increments from the Levy 
process Q, 



so that, when (3 = 2, W is a standard Wiener process. The parameter to be estimated 
is a, and we will single out two situations concerning the parameter space O: either 
= (0, do), or G is a compact subset of (0, oo). 



(3) 



Xi - XiA n ~ ^(i-l)A„- 



W is a symmetric stable process of index (3 G (0, 2], characterized by 



(4) 




4 YACINE AIT-SAHALIA AND JEAN JACOD 

The law of Y (as a process) is entirely specified by the law Ga of the variable 
Ya for any given A > 0. We write G = Gi, and we recall that the characteristic 
function of Ga is given by the Levy-Khintchine formula 

(5) E{e lvY *) = exp A ^ivb - ^ + J F{dx) (e ivx - 1 - wxl {N < 1} )^ 

where (b,c,F) is the "characteristic triple" of G (or, of Y): b G R is the drift of Y, 
and c > the local variance of the continuous part of Y, and F is the Levy jump 
measure of Y, which satisfies f (l A x 2 ) F{dx) < oo. We will denote by P a ,G the law 
of the process X. 

We make Y "dominated" by W in the following sense: G belongs to the class Qr, 
defined as follows. Let first $ be the class of all increasing and bounded functions 
4> : (0, 1] — > R + having lim^o 4>( x ) = 0- Then we set 

(6) 

Q(4>, a) = the set of all infinitely divisible distributions with c = and, for all x G (0, 1], 

f x a F([-x, xf) < <j){x) if a < 2 
then < 

(x 2 F([-x,x] c ) <4>{x) and S M < x} \y\ 2 F{dy) < <f>{x) if a = 2, 

(7) Q'((f),a) = {G<EG(4>,a), G is symmetrical about 0}, 



(8) Ga = U</, 6 $ G(<f>,a), G' a = U0 6 $ g'((/>,a), 

and we have 
(9) 

a G (0, 2] <? a = {G is infinitely divisible, c = 0, lim^o x a F([-x, x] c ) = 0} 

a = 2 =>■ ^2 = {G is infinitely divisible, c = 0} . 
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Now we recall some results from j2]. The variable W\ admits a C°° density hp, 
which is differentiable in the state variable (the derivative is denoted by h'^). Then 
we set 

( 10 ) 

' ZiP) = J h l3^ lr) " ! "'' 



hp{w) = hp(w)+wh'p(w), hp(w) = f_ fn:A , hp(w) 



wh'p{w) 



so in fact is the Fisher information when we estimate a on the basis of the 

single observation aW\ and for the parameter value a = 1. The functions hp and 
hp and hp are also C°°, and satisfy for some constant cp: 
(11) 

(3<2 hp{w) + \hp(w)\ + \hp{x)\ < 1+| g 1+<a , \hp{w)\<cp, 



hp(w) = (l-w 2 )hp(w), hp(x) = (l-w 2 Y hp(w), hp(w) 



-w 



and of course h2(w) = e w,2/ ' 2 /v / 2vr, so in particular T{(5) = 2. 

If we have a single observation Xa there is a (finite) Fisher information for es- 
timating a, which we denote by I&(a,G). With n observed increments the corre- 
sponding Fisher information becomes 



(12) 



The main result of j^j, as far as the parameter a is concerned, is summarized in the 
following: 

Theorem 1. a) If G G Qp we have as A — » 0: 



(13) 



0~ 



b) For any <j> E $ we have as A —* 0: 



(14) 



sup 

GeG(<j>,p) 



Ia(ct,G) 



a- 



c) For each n let G n be the standard symmetric stable law of index a n , with 
a n a sequence strictly increasing to (3. Then for any sequence A n — > such that 
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{f5 — a n ) log A n — > (i.e. the rate at which A n — > is slow enough), the sequence of 
numbers lA n {o~,G n ) converges to a limit which is strictly less thanX{j3) / a 2 . 

Part (a) of the above theorem and (|12[) hint towards the existence of estimators 
a n such that y/n (cr n — a) converges to a centered Gaussian variable with variance 
a 2 /T{j3) under P^Gj when G G Qg is known: this is the parametric situation, and 
we will propose such estimators in Section 0] below. In the semiparametric situation 
where G is unknown, (c) suggests that we cannot achieve the same rate, unless, 
as given in (b), we know that G is in the class Q(4>,a) for some a < f3 and some 
function (f) G 

As a matter of fact, we can do slightly better. If 4>{x) = Q > for all x, we can 
still define G((fi, a) by ©, although </> no longer belongs to We denote such a class 
by GiCoi), that is we introduce the notation (we do not need to distinguish a < 2 
and a = 2 here): 
(15) 

G(Ci a ) = the set of all infinitely divisible distributions with c = and, for all x £ 
then x a F([-x,x] c ) < (, 

(16) Q ((,a) = {GeQ(C,o), G is symmetrical about 0}, 

(17) Q a = U C>0 G((, a), G' a = U C>0 V\C, a). 
The connection with the previous classes is as follows: 

(18) Q((p, a) C G((f>(l),a), Q a C G a C C\ a > g 2 = <? 2 . 

For example, Qq is the class of all G's for which Y is a pure drift (If = bt), whereas 
Go is the class of all G's for which Y is a compound Poisson process plus a drift. 
Also, any stable process Y with index a < 2 belongs to but not to Q a . 
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3. About estimating equations. The practical estimators we will propose for 
a are all obtained by setting an estimating equation (also known as a generalized 
moment condition) to zero. We prove here a general result about the asymptotic 
properties of such estimators, which will be used several times below. Similar general 
results for estimating equations are of course known (see various forms in |2j , [0] and 
) , but we adapt them here to our setting with assumptions (by no means minimal) 
that are sufficient in our context. 

Recall that we want to estimate a parameter a > 0. At stage n we observe p n 
i.i.d. random variables Xi an d introduce two auxiliary variables S n > and Q n £ R. 
Under the associated probability measure P njCr we suppose that the families (S n , Q n ) 
and (xf '■ 1 < i < Pn) are independent, and of course p n — > oo. Let us introduce the 
following conditions: 

Assumption 1 (Al). If a n — > a > i/ien S n — > (7 in P n o - n — probability . 
Assumption 2 (A2). If a n —* a > £/zen the sequence (Qn \ -E*n,<7 n ) ^ tight. 

Next we consider two families (f n ,s,q)s>0 and (-ff n ,s)s>o,qeR of functions on R 
and (0, oo) respectively, to be specified later but with adequate integrability and 
smoothness properties, and we associate the estimating function 



In this exactly-identified context, we set 

^ I the u > with U ns «(it) = which is closest to s if it exists 



(if C/ niSj g = has two closest solutions at equal distance of s, we select the smallest 
one). We also set 



(19) 





otherwise 



(21) 



F n>Stq (a) = B rha (fn,s,M)), Fi%{a) = E ni Mn,s, q (x?) 2 ). 
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Note in particular that we are not assuming that the estimating equation is correctly 
centered: correct centering would requiring using F n>S;q instead of H ns . H n s may 
be equal to F njS ^ q , but can also be just an approximation to it (in which case we 
will talk about "approximate centering") that may for instance be valid as n — ► oo. 
Incorrect centering leads to estimators that are asymptotically biased, although that 
effect can be mitigated as n — > oo if H n s approximates F njSjq (see Assumption (B5) 
below) . 

Let us now list a series of assumptions on the previous functions: 

Assumption 3 (Bl). We have sup n>li s>0i ggR ||/ n ,s, g || 4 /Pn < oo, where \\f\\ is the 
sup-norm. 

Assumption 4 (B2). H n:S is continuously differentiable. 

Assumption 5 (B3). For all s > there is a differentiable function F s on (0, oo), 
such that whenever s n — > s then H nttSn and H' n Sn converge locally uniformly to F s 
and f' s respectively. 

Assumption 6 (B4). f' s (s) / for all s > 0. 

Assumption 7 (B5). F^} nSn (u n ) converges to a limit F^ (u) for any two sequences 
u n and s n converging to the same limit u > and any bounded sequence q n . 

Assumption8 (B6). There is a sequence w n — > +cxd such that sup n w n \F n ^ nSn (u n ) — 
H n ,s n (u n ))\ < oo for any two sequences u n and s n converging to the same limit u > 
and any bounded sequence q n . 

Then we have the following: 
Theorem 2. Assume (Al), (A2) and (B1)-(B6). 

a) The sequence {(w n f\ y/fhi){o n {Sn-, Qn) — o~ n )) is tight under P niCr , uniformly in 
n and in a in any compact subset of (0, oo). 
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b) Ifw n /y/pH, — > oo, then the sequence {yjp^ (cr n (S n ,Q n ) — a n )) converges in law 
under ~P n ,a, uniformly in a in any compact subset of (0, oo), towards the centered 
normal distribution with variance H 2 (<r) := (F( 2 \a) — F r7 (a) 2 ) /F' a (a) 2 . 

We devote the remainder of this section to proving this theorem. First, we state 
a lemma which gathers some classical limit theorems on i.i.d. triangular arrays. For 
each n let {Q : i = 1, . . . ,q n ) be real-valued and i.i.d. random variables, possibly- 
defined on different probability spaces (J7 n ,jF n ,P n ) when n varies. Then: 

Lemma 1. Assume that Q l is square-integrable, and set 7 n = E ra (£") and T n = 
E n((CD 2 ) - In- J fPn oo and T n /p n -> 0, we have 

Pn 

(22) -£C-7„ ^ 0. 

Pfl i=l 

Furthermore ifT n — > T /or some Zirrai T > and z/E(|£"| 4 )/p n — > 0, we have 

(23) f-ECT-Tnl ^ AA(o,r). 

V Pn i=i / 

In the next three lemmas we suppose that a n — > <r > 0, and we write P n = P n ,o- n • 
Lemma 2. Xei s ra — > o" and let q n be a bounded sequence. 

a) The sequence ({w n f\y/p^) U ntSn ^ n (a n ) | P n ) is tight. 

b) If ' w n / ' y/jkl — ► oo i/ien 

v^^^K) £ ^5 ) AA(0,F( 2 )(o-)-F CT (o-) 2 ). 

Proof. We have U n:Snjqn (a n ) = ^ Yli=i Cl\ where for each n the £f s are i.i.d. with 
mean and variance given by 

In = Fn,s„,q„(o~n) ~ -f^n,s„ (°"n) i T n = -F^ J nj q n (o~n) ~~ Fn,s n ,q n i°~n) i 
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and further < a n for numbers a n satisfying a^/pn — > by (Bl). Now (B6) 
yields that j n — > 0, hence (B3) yields Fn,s n ,q n (°") ~~ *■ F a {o~). On the other hand, 
(B5) implies FnJ n ^ qn (a n ) — > a). 

Therefore it follows from (j23j) that 

(24) VK (U n!Sn>qn (a n )- 7a ) £ ^5 ) M(0,F^(a)-F a (a) 2 ), 

and since sup n u> n |7n| < °o by (B6), we readily get the two results. □ 

Lemma 3. a) The sequence ((w n A yS) U n ^s n ,Q n {o~n) I Pn.) is tight. 

b) If w n /^p^ -> oo, the sequence (y/jfa U n ,s n ,Q n ((Tn) I p n) converges in law 
towards the centered normal distribution with variance F^ 2 \a) — F a {a) 2 . 

Proof, a) Let V(n,s,q) = (w n /\ y/p^)U n ^ Stg (a n ). The previous lemma implies that 
as soon as the deterministic sequence s n converges to a, we have for all B > 0: 
(25) 

lim sup UA,B(n, s n ) = 0, where UA,B(n, s) = sup P n (|V(n, s, g)| > ^4). 

If the sequence (V(n, S n ,Q n ) | P n ) is not tight, there exists an infinite sequence 
nk such that P nk (\V(nk, S nk ,Q nk )\ > A) > 1/A for some A > and, up to taking 
a further subsequence still denoted by n k we can assume by (Al) that S nk — > a 
pointwise. Since (S n ,Q n ) is independent of the family (V(n, s,q); s > 0,q £ R), we 
get 

Pn k (\V(n k ,S nk ,Q nk )\ > A) <P„ fc (|Q n J >B) + B nk (u AB (n k ,S nk ))). 
Then l|25[) and Lebesgue's Theorem imply that 

limsup P nk (\V{n k ,S nk ,Q nk )\ > A) < sup P n (|Q n | > B) 

k n 

for all B > and, in view of (A2), we deduce that limsup fc P nk (\V(rik, S nk , Qn k )\ > 
A) = 0: this contradicts the definition of the sequence njt, and we have the result. 
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b) Let us denote by V a variable with law v = JV(0, F^(a) — F a (a)). Let v n ,s,q 
be the law of V(n,s,q) := y/p^ U n ^ s ^q{a n ). The claim amounts to proving that, for 
all bounded continuous functions g, we have 

(26) E n (g(V(n,S n ,Q n ))) - E( 5 (F)). 

For this, it is enough to prove that from any subsequence one can extract a further 
subsequence along which ()26[) holds. So, in view of (Al) and (A2) it is no restriction 
to assume that in fact (S n ,Q n ) converges in law to (a, Q) for some variable Q. 

In fact, due to the independence of (S n ,Q n ) and (W'(n,s,q) : s > 0, q £ R), we 
can replace the pair (S n , Q n ) in the left side of (|26j) by any other pair (S' n , Q' n ) having 
the same law than (S n ,Q n ) and still independent of (W'(n, s,q) : s > 0, q G R). 
Therefore, using the Skorokhod representation theorem, we can indeed assume that 
(S n ,Qn) converges pointwise to (a, Q). Then 

E n (g(V(n,S n ,Q n ))) =E n (^j v n ,S n ,Q n {dx)g{x))\ . 

Since S n — ► a and Q n — ► Q, one deduces from Lemma EJ-(b) that the sequence 
/ v n ,s n ,Q n {dx)g{x) converges pointwise to J v{dx)g{x) = E(g(V)), and it is bounded 
by II^H, so Lebesgue's Theorem yields (|2fi|). □ 

Lemma 4. The sequence c n converges in P n — probability to o~. 

Proof. Exactly as in the previous proof, without loss of generality we can assume that 
the pair (iS^, Qn) converges pointwise to (cr, with Q a suitable random variable. 

Lemma implies that U nj s n! Q n (cr n ) — ► in probability (recall that both w n and 
p n go to infinity). Observe that 

U n ,S ni Q n (u) - U n> s n ,Q n {o- n ) = H n>Sn {a n ) - H n<Sn {u), 

which by (B3) converges (pointwise) locally uniformly in u towards H(u) := F a (o~) — 
F a (u). Hence U n! s n ,Q n (u) also converges locally uniformly in u towards H(u), in P n - 
probability. But by (B4) the function H is null at a and is either strictly decreasing 
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or strictly increasing in a neighborhood of a: then the definition (|2U|) of a n (S n , Q n ) 
immediately gives the result. □ 

Finally, we prove Theorem [21 

Proof of Theorem^ As usual, to get the local uniformity in a for the tightness in 
(a) or the convergence in (b), it is enough to obtain the tightness (resp. convergence) 
under P n = P n ,u n for any sequence a n — > a > 0. Let us write for simplicity a n = 
Vn(S n ,Qn) and U n = U n . Sn ,Q n . 

By (B2), U n is continuously differentiable. We deduce from Lemma0]the existence 
of sets A n with P n (A n ) — > 1, such that on A n we have U' n {d ri ) = 0, and thus Taylor's 
formula yields a random variable T n taking its values between a n and a n , and such 
that 

(27) U n (a n ) = -(cr n - o- n )U' n (T n ) on the set A n . 

Observe that U' n {T n ) = —H' n Sn (T n ),. Since both S n and T n converge in probability 
to a, (B3) implies that U' n (T n ) -> -F' a (a) in probability Since F' a (a) ± by (B4), 
all the results of our theorem are now easily deduced from ((2Tj) and Lemma |31 □ 

With this general result in hand, we now turn to our specific situation: estimating 
a in the presence of the Levy process Y, first when the law of Y is known and second 
when it is not. 

4. Estimation of a in the parametric case. In this section, we study the 
estimation of a when the law of Y, i.e., the measure G G Gp, is known. We will 
construct a class of estimating equations for a, with given by 
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4.1. Construction of the estimators. In the sequel the number (3 G (0,2] is fixed 
and does not usually appear explicitly in our notation. A constant which depends 
only on (5 and on another parameter 7 is denoted by C 7 , and it may change from 
line to line. If G G G a with a < (3, and with the associated process Y, we set 
(28) 

b ~ I{\x\<l} xF ( dx ) if a < 1 ua , 

Z A (a) := A ^ (Y A - b'(G, a) A) 

b if a > 1, 

and we let G" A denote the law of Z^(a). Then we define the "modified increments" 
(recall ©): 



b'(G,a) = < 



(29) 



X?(G)=A-^( x ?-b'(G,f3)A n ). 



Next, for any a G (0, 2] and any </> G $ we set for x G (0, 1): 



( m 

l-Q 



(30) (f> a (x) = < 



0( x ) + /W + fl a e-VMiM") 
Vlog(iA) V / 



if a < 1 
if a = 1 



if a > 1. 

This defines an increasing function c/> a : (0, 1] — > i?+ having 4> < 4> a and <p a {x) 
as x — > 0. 



Next, if G G G a for some a < f3, and tt > and w > and z £ R and if /c is a 
bounded function, we set 



(31) 



^G,A,a,k(u,v, z) = / h/3(x)dx / G' Aa {dw) k{ux + vw + z). 



Finally, we introduce the "tail function" 



(32) 



ij(u) = P(|Wi| > = 2 / hp{x)dx 

n/u 



for u > (this depends on j3): it is C°°, strictly increasing from to 1, with non- 
vanishing first derivative. So its reciprocal function if) , from (0,1) into (0, 00), is 
also C°° and strictly increasing. 



14 



YACINE AIT-SAHALIA AND JEAN JACOD 



Recall that we work here under the assumption that G E Gg is known, and so in 
particular we know b'(G, /?); we also have G E G(<f>, j3) for some <j) E We need first 
a preliminary estimator, which is constructed as follows. We choose an arbitrary 
sequence m n of integers satisfying 



(33) 



m n J oo, 







and, recalling and ([52]). we set 

(34) 



Vn(G) = — £l 



{lx' n (G)l>l}' 



Sn(G) 



i=l 



^(^(G)) if < K„(G) < 1 
1 otherwise. 



To form an estimating equation for the construction of the final estimator of a, 
we choose a function k satisfying 
\k(x)\ 



(35) 



sup 

x 1+ XT 



< OO, 



I(k) := /" hp{x)k{x)d. 



x^O, 



where the number 7 satisfies 

(36) 7 > 0, < 2 



7< 2- 



Then we set 



(37) 



k n {x) 



k(x) 



if k is bounded 



k K x ) 1 {\k(x)\<u n } otherwise, 
where u n be an increasing sequence of numbers satisfying 

(38) v n - 00, vl M*X P ) - 0, ^ - 



0. 



and where ^ is associated with (a function such that G E G((f), (5)) by ()30|) . Then, 
with the notation p n = n — m n , and since each k n is bounded, we can define the 
following estimation functions (for u > 0): 

(39) U n>Gi(f>>k (u) = — 2^ fc n ' 



i=m n +l 



Sn(G) 



G,An,/3,kf, 



S n (GYS n (G) 



,0 . 
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Finally the estimators for a are: 
(40) 

the u > with U n) G,<i>,k(u) = which is closest to S n (G) if it exists 

a n (G,(f>,k) = < 

1 otherwise. 

As the notation suggests, this estimator depend on G and on k in an obvious way, 
and it depends on <p through the choice for k n made in (|H8|). It also depends on (3, 
but we leave this dependency implicit to avoid cluttering the notation. 



4.2. Asymptotic distribution in the parametric case. With the function k as in 
(|35|). the following defines two finite numbers: 

(41) J(k) = V(k{Wi?) - (E(M^i))) 2 , S 2 (fc) = jjj^. 

Theorem 3. Let (f> £ <£, and let k be a function satisfying $3S\) for some 7 having 
i3b}) . Suppose also that A n — > 0. 

a) The sequence y/n (a n (G,4>,k) — a) converges in law to N(0,a 2 T, 2 (k)), under 
P a ,G, uniformly in G £ G(cj), (3) and in a £ [e, 1/e] for any e > 0. 

b) We have £ 2 (fc) > 1/I(f3), and this inequality is an equality if we choose k = hp. 

Now we give a number of comments and examples. 

Remark 1. In light of j41\ ), it is of course possible / advisable to select the function k 
to minimize S 2 (/c). The choice k = hp is indeed possible: by the function k = hp 
satisfies \35\) with 7 = (resp. 7 = 2) if (3 < 2 (resp. (3 = 2). Such a choice gives 
asymptotically efficient estimators, in the strong sense that they behave asymptotically 
like the efficient estimators for the model Xt = aWt (with no perturbing term Y ). 

Remark 2. To put these estimators in use we would need to numerically compute the 
function ^q^^^{u,v,Q), for a single value of v (either 1 orl/S n (G)), and all values 
of u (in principle). Except in special situations (see for instance Section^), there is 
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no closed form for this function, and we have to resort to numerical integration or 
to Monte-Carlo techniques. For this it is of course helpful to have a closed form for 
k (or rather for the truncated k n ). In general, this is not the case for the function 
k = hp (the optimal choice), unless (3 = 2. 

Remark 3. As an example of function k, we can take k{x) = \x\ r , for some r > 
when (3 = 2 and r G (0, (3/2) otherwise (when (3 = 2 and r = 2 this is the optimal 
choice since h2(x) = —x 2 ): the function ^G,A n ,f3,k n * s still not explicit, but it is easily 
approximated by Monte-Carlo techniques, at last when Yt can be simulated, or it may 
be available in closed form for some common distributions ofY. We will do that in 
some detail in Section^ In any event, the limiting variance is easy to compute from 

COP- 

Remark 4. Another possibility is to use the empirical characteristic function of the 
sampled increments, which leads to an closed form expression for ^G,A n ,/3,k n - This 
will be done in Section^ 

4.3. Some preliminaries. Here we gather some results from |2], and also about 
the functions of ()31|) . which will be used to obtain the previous theorem and for 
further results as well. First we recall Lemma 2 of 0: for any <f> £ and with the 
notation (|3()|) . we have for A < 1 and a < (3 and K > and some constant C = C a 
depending on a only, 
(42) 

2(g-g) 2+/3 

Geg(<j),a), \g(x)\ < if (1 A \x\) =^ E(\g(Z A (a)\) <CKAW+^)(l) a (AW+^). 

In fact the proof of this result also works when 4>{x) = Q for all x (with <f> a substituted 
with a constant), thus giving 

_ 2(0-c) 

(43) Ge6(C,a), \g(x)\<K(lA\x\) E(| 5 (Z A (a)|) < CKQA^) . 

This is not enough for our purposes, at least in the semiparametric situation, and 
we will need also the next lemma about symmetrical measures: 
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Lemma 5. If A < 1 and a < (3 and K > 0, we have for some constant C depending 
on a only: 

(44) Ge^Ca), \g(x)\ <K(lA\x\ 2 ) => B(\g(Z A (a)\) < CK(A^. 



Proof. It is similar to the proof of Lemma 2 of [2J. Taking n > 0, we set Y" = 
^2 s<t AY s 1{\ay s \>ii} and Y' = Y — Y" and if 67 G <7 (C>°0 then F is symmetrical 
and thus we have (47) of the afore-mentioned proof (with <j) a substituted with a 
constant proportional to (), that is 

E(|^a| 2 ) < C(Ar] 2 - a 

for a constant C depending on a only. We also have Z A (a) = A —1 ^Y A , hence 
\g(Zs(a))\ < KA~ 2 ^\Y A \ 2 on the set {Y£ = 0}, whose probability is smaller than 
C(A/n a . Since \g\ < K, we deduce 

B(\g(Z A (a))\) < CK( (AtT + A 1 - 2 /^ 2 "") • 
Then take r\ = A 1 /' 3 to obtain the result. □ 



Next, as soon as the function k satisfies the first half of l)35jl with some 7 > 
which has 7 < (3 whenever j3 < 2, we set for u > and z £ R: 
(45) 

^k(u,z) = / fyg(xWux+z) dx = — I ha (—) k(x+z) dx = — \ hn\ ] k(x) dx. 

J u J W u J y \ u J 

(so ^k(u,z) = ^?GA,a,k(u,0, z), which depends neither on G, nor on A, nor on a). 

Lemma 6. a) Let k satisfy the first half of h'J5)) with some 7 > which has 7 < /3 
whenever (3 < 2. TTien is C°° on (0, 00) x i?. If further 7 > and ^ 6 (0, 00) and 
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k u (x) = k^ln^M^x, then for all K > there exists Mx^k such that 

(46) 



z\<K, v> M KM 

Qj+l 



+i 



dui dz l 



< 



if 13 < 2 
if = 2. 



(47) 



b) If k is bounded, then for all rj £ (0, 1) we have 

Qj+l 



rj < u < l/rj 



duj dz l 



< Q 



1,3, V 



Proof, (a) If I € iV, the jth derivative of u i— ► (—l) l h^\x/u)/u lJrl takes the form 
hij(x/u)/ui +l+1 for a function satisfying 

Cj,i/(1 + \x\ 1+l+ P) if < 2 



(48) 



Cjtil + \x\ 2 i +21 ) e~ x2 / 2 if = 2. 



In particular the estimate for < 2 above also holds for = 2, and further /i/j is 
differentiable and, for all G (0,2], 



(49) 



K,(*)l < 1 + mW 



Therefore we easily deduce from (j45j) that is C°°, with (by differentiating I times 
the last term in (|45)L then j times the analogue of the third term with hi instead 
of hp): 



(50) 
dvJ dz l 



> z ) = yj+i+i J hi,j{x/u) k(x + z) dx = -J^-j J hij(x) k(ux + z) dx. 



In particular, for some Ek > depending on the function k, we have 



+1 



dvJ dz l 



$? k (u,z) 



Qj+l 

dvJ dz l 



< u jli+i J \k(x + z) - k u (x + z)\hi tj (x/u) dx 
C k 



< 



uj+l+1 J{i+\*+*\ 



(1 + \x + z| 7 ) hij(x/u) dx. 
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Then a simple computation, using (|48[). gives us (|46j). 

(b) When A; is bounded, (jiSJ) and (JSOJ) immediately yield (|4"7|). 
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□ 



Finally we give estimates for the difference Aak an d ^k- 

Lemma 7. If kis a bounded function, *&G,A,a,k( u , v > z ) * s C°° * n (^j- 2 )? and for any 
n £ (0, 1) we have 



(51) f] <u< 



gj+i 



^GA,a,k{u,V,z) 



I A: I 



\ + \z\ l +P' 



dvJ dz l 

Moreover, for all n £ (0, 1) we have the following, for all A < 1 and z £ R and 
u G [t/, I/77] and v S (0,1/77] ; 



(%) IfGe G(4>, a) (resp. G £ £/(£, a) j, i/ien luit/i </> Q given 6?/ (Vesp. a = (J: 



(52) 



g- ^G,A,a,k(u,v,z) ~ q^J *fc(w,0) 



2(<3-a) 2+ff 

< C ji7? ||fc|| |z| + A' 3 < 2 +«)(/) a (A' 3 ( 2 +-)) 



(53) 



(ii) IfGeG (C,a), f/ien 



(7(/? *GAa,*( M ' ,, ' Z ) ~ *fc( u >°) 



z| + CA 18 



Proof. Observe that ^GA,a,k{u, v, z) 
*G,A,a,fc is C°° in (u, z), with 

Q3+1 



J'G' Aa (dw) ^ k (u,vw + z). Then by (|47|) . 



(54) 



^GA,a,k{u,v,z) = I G' A Jdw) 



and for any n £ (0, 1) we have (|51|l. 



* fc (n,wu; + 2), 



Next we prove (i). (|4"§)) yields 

(55) \y\ < 1 =^ |^o,i(^ + y) - V?(aO| < Cj,m ^ + ^p+zr 
Recalling (|5U() and (|54j) . we have 

(56) ^— fG,A,a,fc(ti,«,2) - —j ^ fc (u,z) = / G Aa (dw) g(w), 
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9 ^ = IhJ VW + Z) ~ IhJ ^ fc(n ' ^ 

1 

~~ vJ 
1 

~~ vJ 



hoj(x) (k(ux + vw + z) — k(ux + z)) dx 

(vw \ \ 
x J — hoj(x)J k(ux + z) dx , 



for u,v,z,j fixed. Let r\ £ (0,1), and suppose that -q < u < 1/tj and that v < 1/r/. 
If \w\ < 1 (|55j) obviously yields |<7(u>)| < Cj tT] \\k\\ \w\, whereas (|47j) yields |g(w)| < 



C 



.)■>! 



always: so we have |<?(u>)| < C. 



.)■>] 



w\ (\ 1), and in view of (fo^|) we readily 



deduce from if G G G(<p,a) and <@3J) if G 6 5(C,a) (then <£ Q = C), then 

2(i3-a) 2+/3 



(57) 



<9 J <9-? 



< C 



M u k\\ A^P+«)^ a (A?M), 



Moreover gTJ) yields * fe (u,z) - £j V k {u,0) 
these together gives (|52|) . 



< a 



|z|, so putting all 



Finally we prove (ii). The function hoj is C°° and all its derivatives satisfy the 
estimates (|48|). and in particular ff(x) = sup J/g [ :r _ 1 /^2 ^+1/^2] |^o'j(2/)l ^ s integrable, 



as well as h' j. Now we have 



(58) 



\w\ < 1 



hod [ x ~ ~~ ) ~ hoj( x ) ~ h'oj{x)—- < Cj^w 2 H(x 



cu- 



ds soon as v < l/rj and tj < u < 1/rj. Therefore we can write g = g\ + Q2-, where 

vw f 
9i(w) = —ppf 1{H<1} / h' 0j (x)k(ux + z)dx, 

^o,j (z - —) - h ,j(x) - h Qd (x)—J k(ux+z)da 



u / " u 

On the one hand, if G £ {7(Ci a ) then is symmetrical about 0, hence 

J gi(w)G' A a (dw) = because gi is bounded and odd. On the other hand, 

((SHI) pl us the integrability of H and the fact that |g(u>)| < Cj^ll^ll yield 

|<72(^)| < Ci,r?||fc||(^ 2 A !)• Hence, using Lemma El we get instead of (|57j) that 



(59) 



dv? 



(u,v,z) 



dvJ> 



< C jv \\k\\ CA" 



g-q 
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and we conclude as previously. 
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□ 



4.4. Proof of Theorem^ We start by proving (b). With the notation H = hp/hp, 
we observe that in addition to (|41j) . we have 

I(k) = E(k(W 1 )H(W 1 )), 1(0) = E(tf (H^) 2 ). 

An integration by parts yields E(H(W 1 )) = 0, so J(k) = E(k'(Wi) 2 ) and I(k) = 
E(k'(Wi)H(Wi)) if k'(x) = k(x) - E(k{W x )). The desired inequality, which is 
I(k) 2 < J(k)I(f3), follows from the Cauchy-Schwarz inequality. If k = hp we also 
have k = 1 + H, so this inequality is obviously an equality. 

For (a), and since p n ~ n, we apply Theorem |2Hb) with xf given by © and thus 
Pn,a = Pa,G- The first step consists in proving (Al) for S n = S n (G). This amounts 
to the following lemma, where a n — ► a > and P n = P an ,G : 

Lemma 8. The sequence S n converges to a in 'probability. 

Proof. By (|42|) the variables Z 1 ^ ((3) associated with the law G n converge in law to 
(because 4>p ( — > 0). The variables xT ■> which equal a n Wi + Z\ (/9) in law, 
converge in law to &W\. Hence 7 n := P n (|x'i n | > 1) ~ ► V^* 7 )- If CP = l{|xf|>i}> <E2> 
applied with g n = m n yields V n -— > VK ")- Since V* -1 is C°° and strictly monotone, 
the result readily follows. □ 

Next we set Q n = 0, so (A2) is satisfied, and 

fn,s,q[x) = k n I I , H n>s (u) = 9 G ,AnAkn [ ~i ~> J • 

Upon comparing (|39|) and (jjQj) with (fl9|) and (|2fl|) . we see that a n (G,(f),k) = 
&n(S n ,Q n ). Therefore it remains to prove (B1)-(B6) with a sequence w n satisfy- 
ing w n /^/p~^ — » do, and that 

(60) E 2 a) =a 2 J(k)/I{k) 2 . 
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Observe that under P a> G the variables xf have the same law as aW\ + Zj± n {fi). 
Then (|21j) gives F ns g{a) = H ns (a). It follows that (B6) holds with w n arbitrarily 
large, while (B2) follows from ()54|) , 

If k is bounded, hence k n = k, we have ||/ n ,s|| < ||^|| and (Bl) is obvious; further, 
with a = j3 and k r yields 

J =0,1, r = 1,2, /?<«< i u< i 
£r *G,A„,/3,fc-(« > «,0) - ^ * fc r(«,0)| < C %k <j> p {£^l p ) t 

which gives (B3) with F s (u) = * fc (w/s,0) and (B5) with F^ 2 \u) = %j(1,0). On 
the other hand when k is unbounded we have ||/ n ,s|| < u n and thus (Bl) follows 
from (jHEJ; further, v n — * oo and we can combine (|52j) with (|46j) to get for all n large 
enough: 

j = 0,1, r = l,2, r/<n<i, «<i => 

if /3 < 2 
if /3 = 2. 



£7 *G,An,/9,fc!l("i". ) " £j 



< 



C v , k lis r n 4> l3 (A 1 n / P) + 



1 

,3/7-7-1 



^ fc (^ 2 (Ay 2 )+e-^ /r7 



Then, in view of (|38p and 27 < /3 when /? < 2, we again deduce (B3) with F s (u) = 
¥ fc (u/s,0) and (B5) with F( 2 )(u) = %2(1,0). 

Since /io,i = — fys? we deduce that F CT (a) = 1^(1, 0)/<r = -I(k)/a (recall JHUJ) 
and the second part of (|35j)). hence (B4) holds. We also have F a (a) = ^(1,0) = 
E{k{W x )) andF( 2 )(a) = ^(fc(M^i) 2 ), hence J(k) = F^(a)-F a (a) 2 and © follows. 



5. Estimation of er in the semiparametric case. Perhaps more realistic than 
the situation of Theorem |3] is the case where we want to estimate a, but the measure 
G is unknown, although we know that it belongs to the class Gq. This is a semi- 
parametric situation: parametric as far as aWt is concerned, but nonparametric as 
far as If is concerned. Because G is unknown, the estimating equations in this case 
must be based on the law of W alone. The challenge is then to achieve rate efficiency 
despite the lack of information about G. 
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5.1. Construction of the estimators. As said before, we cannot hope for estima- 
tors a n that behave nicely for all G G Ga at once. Therefore we suppose that G is 
unknown, but is known to belong to Q(Q,a) for some a < f3 and some C > 0: we 
refer to this as Case 1. We also consider a more restrictive situation, called Case 2, 
for which G is known to belong to the set Q (£, a). 

The construction looks pretty much like the previous one, except that besides 
our preliminary estimator for a we need to produce an estimator B n for the drift 
b'(G,a) in order to remove it. In Case 2, since we know that b'(G,a) = we just 
set 

(61) B n = 0. 

In Case 1 we set m n = [Sn] for some arbitrary 5 £ (0, 1/2) ([x] denotes the integer 
part of x), so that m n ~ Sn. Then we pick a C°° and strictly increasing and odd 
function 9, with bounded derivative and 9(0) = and 9(±oo) = ±1 (for example 
9(x) = ^ arctan(x) ), and set for u G R 

(62) R n {u) = — Y,0{K X/P (x 1 }-u)). 

Since u \— » R n (u) is continuous and decreases strictly from +1 to —1 as u goes from 
— oo to +oo, we can set 

(63) B n = inf (it : R n {u) = 0) (= the only root of R n {.) = ). 

Next we construct our preliminary estimator for a. In Case 1, and with m n as 
above, we set q n = m n and p n = n — 2m n . In Case 2, we choose a sequence m n 
satisfying and then we set q n = and p n = n — m n . Then in both cases we set 

q n +m n 

— Yl 1 {|A- 1 /' 3 ( X ?-B„)[>1} 

^~ l {V n ) if < V n < 1 

< 

1 otherwise. 



(64) V n = 
and 

(65) S n = 
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To form estimating equations for a, we choose a function k satisfying 1)35(1 with 
7 = (that is, k is bounded and I(k) ^ 0). With ^ given by (|45j) we define the 
estimating functions (for u > 0) 

<» ^"S. t + /t^f^)-(i.»). 

l=<?„+m n + l \ / 

and the final estimators 



the it with U n (u) = which is closest to S n if it exists 



1 otherwise. 



(67) a n {k) 



Note that, unlike the centering ^G,An,P,k„ y g jq} , 5 1 |g) > J utilized in the para- 
metric case (recall (|39|) ). the centering we now use, based on ^ ^-j-, 0^ in (|66|) does 
not involve the measure G. Indeed, these estimators depend explicitly on 8 and k, 
but on nothing else, and in particular not on G. Observe that they are much easier 
to compute than the estimator of the parametric case. This is particularly true when 
k(x) = cos(u)x) for some w > 0, since then ^j,(u,0) = e - wl3ul3 / 2 [ s fnvertible in u, 
and we will detail this example in the next section, but it is also true in general: first 
because they depend only on the function ^(u, .) which is much simpler than the 
function ^GA,P,k accruing in the estimation in the parametric case, second because 
as a rule u \—* ^ f / c (tt,0) is at least "locally invertible" around u = 1. 

The estimators (|66[) have formally the same expression in both Case 1 and Case 
2, but the preliminary estimators B n and S n disagree for the two cases and also 
p n ~ (1 — 25)n in Case 1 and p n ~ n in Case 2, a difference which is important for 
the asymptotic variance of the estimators. So we will write "the Case 1 version" or 
"the Case 2 version" of the estimator. 



5.2. Asymptotic distribution in the semiparametric case. Recall the notation I{k) 
and J(k) and S 2 (fe) of (|3"5|) and (|41() . and let us add some other: 

/ \ , „x 2(8 — a) ,, „v 8 — at 
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Observe that p(a,(3) < p'(a,f3) always. 

Theorem 4. Let a £ (0, (3) and £ > 0, and k be a bounded function with I{k) ^ ; 
and e £ (0, 1). Take the Case 1 version of the estimators. 

a) If 

(69) sup nA 2 n p( - a ^ -» 0, 

n 

the sequence ^fn (a n (k) — a) converges in law to iV(0, a 2 T, 2 (k) /(l — 25)) under P a ,G> 
uniformly in n > 1 and in a £ [e, 1/e] and in G £ G(C, ot). 

b) In general, the variables {yjn f\ A n p ^ a '^)(a n (k) — a) are tight under P a ,G> uni- 
formly in a £ [e, 1/e] and in G £ Q(Q,a) and n. 

Theorem 5. Let a £ (0, (3) and £ > 0, and k be a bounded function with I{k) ^ ; 
and e £ (0, 1). Take the Case 2 version of the estimators. 

a) If 

(70) sup nA 2 /^ -► 0, 

n 

the sequence ^fri (a n (k) — a) converges in law to N(0,a 2 Yl 2 (k)) under P a ,G> uniformly 
in n > 1 and in a £ [e, 1/e] and in G £ Q (£, a). 

b) In general, the variables {-\/n f\ A n p ^ a '^)(B n (k) — o~) are tight under P a ,G> 
uniformly in a £ [e, 1/e] and in G £ G'{(, a) and n. 

The optimal choice of the function k has been discussed after Theorem |3J when 
(3 < 2, we have asymptotic efficiency in the situation of the second theorem above, 
provided we take k = hp, and despite the fact that we are in a semiparametric 
setting. When (3 = 2 the choice k = hp, that is k(x) = —x 2 , is not permitted in the 
above theorem, but with k(x) = — ^ 2 1{| X |<A} one achieves an asymptotic variance 
which approaches the optimal variance when A goes to infinity: see Sectional 
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Also, some other comments are in order here: 

Remark 5. When a increases, then p(a,(3) and p'(a,(3) decrease, so h6y\) and \7C\ ) 
are more difficult to obtain and the "rate" in (b) of the two theorems above gets worse, 
as it should be. 

Remark 6. In connection with what precedes, one should mention that when \6y\) 
fails the actual rate of convergence (that is, a sequence 5 n such that the law of 
S n ((a n (k) — a) converges to a non-degenerate limit, or at least admits among its 
weak limiting measures a non-degenerate one) is not only unknown, but actually de- 
pends on the true underlying (unknown) measure G and in particular on the minimal 
index a' such that G £ Q a i (we know that a' < a, but the inequality could be strict). 
In other words, the rate could be for example y/n for a particular G, even without 

fig). 

Remark 7. However we will see in the examples below (see Section^ in particular) 
that \7U\j) is necessary for having convergence to a centered distribution with rate y/n 
and also that the rate in (b) of Theorem\J^is sharp, if we want to have a result which 
holds uniformly in G £ Q ((,a). We do not know whether \6$) or the rate in (b) are 
optimal for Theorem^ 

Remark 8. Of course it might exist other - thoroughly different - estimators behaving 
better than the o n {k) 's, and perhaps having a better rate than in (b) of these theorems 
(the rate cannot be improved in (a), of course). We think this doubtful, however. 

Remark 9. The most interesting situation is when we have asymptotic efficiency 
(this happens when G is symmetrical), or at least "rate-efficiency" (that is of order 
\fn). We have this under i69\) or j7(J\ ), which mean that A n goes to fast enough. 
Of course having A n = o(l/n) is of no practical use. When A n = 1/n, then rate- 
efficiency is satisfied as soon as a < 2/3/(4 + (3) for the first theorem and a < (3/2 for 
the second one. If Y is a compound Poisson process with drift, rate efficiency holds 
as soon as nA^ is bounded, whatever (3 G (0, 2] is (take a = 0). 
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Remark 10. When we do not know that G is symmetrical we cannot achieve asymp- 
totic efficiency even under ib'ty) . However the asymptotic variances in the two theo- 
rems above are the same, up to the factor 1 — 25: hence by choosing 5 small one can 
approach asymptotic efficiency as much as one wants to. 

5.3. Proof of Theorems^ and\^ As above, we refer to Theorem |3] as to Case 1, 
and to Theorem [21 as to Case 2. The proof goes through several steps. 

1) We fix a S (0, j3) and Q > 0. The sequence A n is fixed, and we set 



(71) P = 



p(a, (3) in Case 1 ^ 
p'(a, f3) in Case 2, 



In order to get tightness or convergence, "uniform" in a and in G is the relevant 
class, it is of course enough to take a sequence a n — > a > and a sequence G n 
in Q((,a) (resp. </(£,«)), and to prove the tightness or convergence in law of the 
normalized estimation errors d n — a n , under the measures P n = P an ,G n - Below we 
fix the sequences o~ n and G n . 

Finally, we denote by Z n := Z^(a) the variable associated with the measure G n 
by (J2HJ), and we set b' n = An 1 ^b' (G n ,a), which vanishes in Case 2. 

2) Let Q n = X n B' n , where B' n = (A^^B-n — b' n ). We want to prove that the 
sequence Q n satisfies (A2). This is obvious in Case 2 because Q n = 0. So we suppose 
that we are in Case 1. Let us introduce some notation: with j = 1,2 and 9' being 
the derivative of 6, we put 

(72) Tj(a) = Effl^n T'^a) = E(0'(<tWi)) 



(r^ is of course the derivative of T\). 
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Observe that B' n is the only root of R n {.) = 0, where 

K n (u)=R n (AVP(u + b' n )) = —J2C?(u), with C(u) = e(A- 1 /P x ?-u-b' n ). 



The £™(u)'s for i > 1 are i.i.d. with the same law (under P n ) than the variable 
0(a n Wi + Z n — u) (we have used here the scaling property of W). 

The functions ^ n ,j{u) = .E n ((£"(«)"'), for j £ N, are C°° and bounded as well as 
their derivatives, uniformly in u and n, and we can interchange derivation and expec- 
tation. So we can apply (|4*3*)) to the functions g n ,j,p{w) = / hp{x)(d p 9^ j 'du p )(a n x + 
w — u) — {d p 9 3 / 'du p )(a n x — u)) dx, to get for p,j £ N: 



(73) 
QP 

ln,j{u)-T jiP (a n ,u) 



f d p 9 j 

< C pj C A n> where r j P ( v > u ) = (- 1 ) / -Q^-( vx - u ) h p(. x ) dx - 



In particular r^cx, 0) = ^ (u) for j = 1, 2 and i\i(<7 n , 0) = T^cr) with the notation 

CZ2D- 

Now, also is C°°, bounded as well as all its derivatives, uniformly in n, u and 
uj. So an application of Lemma and the continuity of the functions Tj p readily 
yield 



QP 

(74) — — 1Z n (u) — ► Fi P (a, u) locally uniformly in u, in P n -probability, 
ou p 



(75) r? n := JW n (^n(0)-7n,l(0)) ^ AA(0, r 2 (a) - T^a) 2 )). 

The properties of 9 imply that u i— > Ti^a, .) decreases strictly and vanishes at 0; 
since by construction R n (B' n ) = 0, we deduce from ((7i|) for p = that -B^ — ^ 0. 
Another application of (|74() yields that R' n {B'^) — ^ T^cr) for any sequence B" t of 
random variable going to in P n -probability. Since R n (B' n ) = we have 

(76) K(K) K = -TZ n (0) = ~1= - 7n,l(0) 
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for some random variable B" satisfying \B'^\ < \B' n \. Moreover ri o(0) = 0, due to the 
fact that 9 is odd, hence |7 n ,i(0)| < CQK by {29 . Since R' n (B'^) ^ T'^a) ^ 0, 
we deduce that Q n = \ n B' n satisfies (A2) from (|75)l (recall m n ~ 5n here and ((7T|) ). 



3) Now we proceed to proving the consistency of the preliminary estimators S n . 
In Case 2 the variables V n and S n are the variables V n (G n ) and S n (G n ) of (|2l?|) and 
(JHIJ (they do not depend on G n in fact), so the result follows from Lemma El In 
Case 1, set 

q n +m n 

i=<Jn + l 

Then (JUJ) yields 

(77) Vnfwn) - 8 n (v n ) ^ 0. 

However, A n 1 ^(^" — t> n ) has the same distribution as a n W\ + Z n + 6' n — A n 1 '^v n , 
which by (|4*2l converges in law to aW\ as soon as b' n — A n l ^v n — ► 0. Since -B n and 

(Vn(w) : v £ R) are independent and 5^ = /S. n 1 ^ B n — b' n -^A because Q n = A n -B^ 

p 

satisfies (A2) and A n — > oo, we deduce from (|77f) that F n = V^(.B n ) — ^ VK* 7 )- Then 

the consistency is proved like in the end of Lemma |HJ 

4) At this stage we will apply Theorem [3 with the variables (S n ,Q n ) as above 
and the i.i.d. variables (Xq n + mn +i : 1 — * — Pn)- Observe that with the notation Q2UJI 
and (|67jl. we have a' n {k) = &n(S n ,Qn)- We have shown (Al) and (A2) in the two 
previous steps. Set 

w*> = * ( ^ Wx - b 'n-^A f HnAu) = ^ f « v 



s 



Then (jUJ) gives for r = 1, 2: 



Fn,s,q(u) — ^G n ,An,a,k ( ~ , ~, 



s s sA, 



"J > F Slq( u ) = ^G«,An,a,fc2 (-, -, M 
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Let us check (B1)-(B6). Since k is bounded, (Bl) is obvious, whereas (B2) follows 
from Lemma El Next, if we set F s (u) = VP^u/s, 0) and F( 2 \u) = \P fe 2(l,0), Lemma 
[3 yields for j = 0,1 and r] £ (0, 1) and s,u G [77, 1/77] and |g| < I/77: 

^^ n »-^F s («)|<C M CA^ 

\F njSjq (u) - H n!S (u)\ < C Kv ((Ar n + y^j . 

These give (B3) and (B5), and also (B6) with w n = \ n . Finally (B4) holds because 
f' s (s) = V4(l> 0)/s = —I(k)/s, and (|6U|) holds here as well as in the previous section. 

We can thus apply Theorem the sequence A n (a n — a n ) is tight under P n in all 
cases, and this gives the two claims (b). Under (|69|) or l|70j) we have X n /Vn — ► 00, 
hence X nA /p^ — > 00 as well, so ^Jpn~ ia n — a n ) converges in law under P n to a centered 
Gaussian variable with variance 



I 2 (a) 



FW(a)-F a (a) 2 



which in view of F^^a) 2 = J(k)/cr 2 equals a 2 Y?(k): since p n ~ (1 — 25)n in Case 1 
and p n ~ n in Case 2, we obtain the two claims (a). 

6. Example: The empirical characteristic function. We now turn to spe- 
cific estimators. To each specification of an admissible function k (in the sense of 
satisfying the assumptions of the above results), corresponds an estimator for a. For 
instance, one way of estimating a parameter for i.i.d. variables Xj is to use the em- 
pirical characteristic function, that is exp(iwXj) for some given w (or several 
id's at once) and where J is the index set. If the -Xj's are symmetrical, one should 
in fact look at the real part only, that is X^eJ cos(wXj). Other estimators based 
on the empirical characteristic function in related contexts are given by e.g., [S], [3], 
@], Chapter 4 in 11 and pj 
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In the parametric situation, at stage n the variable Xj is x'l(G) an d J = {f^n + 
1, . . . , n}. Those variables are "almost" symmetrical (the leading term W coming in 
them is symmetrical). So we consider for any given w > the variable 

where S n (G) is the preliminary estimator. In other words, if we take k(x) = cos(wx) 
(a bounded function, so k n = k in (|37|)). the estimating function of (|39|) is 

(79) U n>QAk {u) = KM - *G,A B ,/S,* (g^ffj > ^7G) 5 °) • 

Furthermore, this class of functions is one for which the function ^G,A,/3,fe is 
explicit, at least when the exponent in the Levy-Khintchine formula for Y is explic- 
itly known. More precisely, let us write p(u) for the exponent in (JSJ), and recall that 
E(expiuYt) = exptp(u). Then obviously when g{x) = e lwx we have 

f G , A , A9 (ii, v, 0) = exp (^_]^_ + Ap(wvA~ 1/(3 ) - iwvb'(G, ajA 1 ' 1 ^ . 

Taking the real part, and using (|28|) and the fact that G £ Gp, we see that for 
k(x) = cos(wx) we have 

(80) *G,A,Afc(«, v, 0) = e A ^ co S (B A (u, «)), 
where 

(81) A A («,«) = + / *W (cosfwDA 1 - 1 /^) - 1 



(82) B A (u,v) 



jF(dx) siniwvA^^x), if /3 < 1 

k / F(cte) (^(tyuA 1 - 1 /^) - toi;A 1 - :L //»zl { | a> |< 1 j) if /? > 1. 
So we can inject these formulas directly into (|79|). 
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As for the asymptotic variance in Theorem |3J it is even simpler. Indeed, we have 
here 

(83) V k (u,O)=e- w0u0 / 2 . 

Therefore J(jfe) = -*£(1,0) = w 13 e~ wf 'I 2 /2 > and J(k) = l(* fc (2,0) + 1) - 
^ fc (l,0) 2 = \ (l + e-^y 3 ' 2 ^ - e~ w \ and thus 

(84) £ 2 (fc) = 2 Z 4 . 

When (3 < 2, it turns out that the minimal variance is achieved for some value 
w = wp £ (0, oo), whereas S 2 (/c) tends to oo when to goes either to or to oo. In 
contrast, when (3 = 2 the variance S 2 (A;) goes to 1/2 as w — ► 0: recall once more 
that 1/2 is the efficient variance in that case. 

For the semiparametric situation, things are even simpler. The estimating function 
of (j5ni> becomes 



(85) 



provided in (|78|) we sum over i S {q n + m n + 1, . . . , n}. Moreover u i— ► ^(u, 0) is 
invertible, so the estimator a n (k) takes the simple explicit form 

(86) cr„,(fc) = -log — 2^ cos ET^ 




W \ \ Pn \ S. 

1 i=q n +m„+l 

if the argument of the logarithm is positive (otherwise, put for example <x n (&) = 1) 



7. Example: Power and truncated power functions. Another natural 
choice for the function k is a power function, that is k(x) = \x\ r , for some r > 
when (3 = 2 and r € (0,(3/2) otherwise (when (3 = 2 this is - in principle - optimal 
for r = 2). In general, the function A n k„ 1S n °t explicit but can be numerically 
approximated via Monte-Carlo procedures for example. We can also compute the 
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limiting variance: with the notation m r = E(\Wi\ r ) we get I(k) = —rm r and 
J(k) = rri2r — ntf, hence 

<«> ^ - ^ 

When (3 = 2 we have a closed expression for m r (see Q95|l below), and not surprisingly 
S 2 (/j) achieves its minimum, equal to 1/2, at r = 2: recall that 1/2 is the "efficient" 
variance in that case. When j3 < 2 we have no explicit expression for these moments. 
However, £ 2 (/c) goes to oo when r increases to (5/2, and we conjecture that S 2 (fc) is 
monotone increasing in r (this property holds at least when (3 = 1); so one should 
take r as small as possible, although r = is of course excluded. 

In the semiparametric setting, the previous choice is not admissible, since k has to 
be bounded. So we must "truncate" the argument, by using the following function 



M x ) = M ri {w<7} 

for some constant 7. The function ^^(it, 0) = u r £ , (|W / i| r l{|w 1 |< 7 / u }) is invertible 
from a neighborhood / of u = 1 onto some interval and we write VE^ (v) for the 
inverse function at v G I'. Then if B n and S n are the preliminary estimators, and if 



1 " 

(89) Vn {i) = — J? £ |x? - Bn\ r l m & A i» } , 

Vn^n j=m n +l 

the estimator CT n (A; 7 ) is defined by 

(90) a n ( kl ) = S n (^P) 

if the argument of ty^ 1 above is in I 1 , and a n (k^) = 1 (for example) otherwise. This 
is almost as explicit as (|86|) is. Since k^ is even we again have J(fc 7 ) = 0, whereas 

(91) £ 2 (/o 7 ) = — — I— — -^-—5, where M 7 , s = E(|mri { | Wl |< 7} ). 

(rM 7)J . - 2/i/j (7)7 + ) 

We can then try to minimize this variance, by appropriately choosing the two con- 
stants 7 > and r > 0. 
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One could also use & 7n , the rth power truncated at some level 7 n > depending 
on n: our general results do not apply, but similar results, with possibly other rates, 
should obviously apply. In fact, in the next section we work out completely this 
kind of truncated power functions in a particular case, to check that it is best (for 
the rate of convergence at least) to take a constant level 7„ = 7, as it is implicitly 
proposed in the method previously developed. 

8. Example: Brownian motion plus Gaussian compound Poisson pro- 
cess. In this section, we present a fully worked out example, where W is Brownian 
motion and Y is a compound Poisson process with Gaussian jumps, say iV(0, 77), 
and intensity of jumps given by some A > 0. | T and j3 4 studied the estimation of 
the parameters of this model, using a variety of methods. 

As usual, we are interested in estimating the parameter a given the increments 
Xf of X t = aWt + Yt (see (JHJ). We consider a number of estimating equations for 
this model, based on the power or truncated power variations 



for r £ (0,2]. Here t(A) is the truncation rate, taken to be of the form t(A) = 



Note that V n above is exactly V n (j) of (|59*|) with 7 = A n r(A n ) (here Y is 
symmetrical, so B n = 0). The associated estimator is then given by 



where H A is the local inverse around 1 of the function H&(u) = E(\uW^\ r ln u w A \< T (A)})- 

When c = 00 we get the (non truncated) rth power variation. If c < 00 and k = 
this corresponds to taking k = k c , as given by (|88|1: we essentially eliminate from 
the sum above the increments in which Y jumps. When k > we eliminate more 
increments, and fewer when k < 0. 



(92) 





(93) 
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The expected values of the powers without truncation are given by 

+ 00 eyr/2 / 1 I \ 

(94) E(|x A |') = g^r(i±rj e-^iXAy^A + j^ 2 , 



or/2 /1 , \ 

(95) B(\aW A \ r ) = -j=T i^-j a r A r / 2 

With truncation at rate t(A), we get 
(96) 

E (|X A ri{|X A ]<r(A)}) 



e 



where T(a, •) denotes the incomplete Gamma function of order a, and 

(97) E(|^ ari „ (A)) ) = ^(r(l^)-r(l^,^))^. 

When r = 2, we have V (3/2) = yvr/2 and T (|,x) = e~ x ^/x + t/tt $(^2x) where $ 
denotes the cdf of the iV(0, 1) law. Similarly simpler expressions are also obtained 
in the case where r = 1, since T (1) = 1 and T (1, x) = e~ x . 

As described above, in the semiparametric case where the distribution of Y is 
not known to the statistician, we propose to use an approximate centering based on 
computing these expectations assuming that X = aW only (i.e., as if there were 
no jumps) and we will study the behavior of this estimator when Y is in fact a 
compound Poisson process. The effect of the misspecification error is to bias the 
resulting estimator of a. But, at the leading order in A, the expected values of the 
moments functions computed without jumps coincide with those computed under 
the correct specification. Indeed, for X from we have 

E(|XaD = E(|cTiyAr)+o(A r/2 ) 

E(|Aa|1 

{|x A |<r(A)}) - E (|<tWa|1{| (T w a |< t (a)}) +o(A 7 ' 2 ), 
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with the second result following from 

r(a)+x a (-i + T f^ + 0(x 2 )) near 

e -x x -l+a(i + 2=± + o(x- 2 )) near +00. 



(98) T(a,x) 



As a result, the bias of the estimator of a based on approximate centering will 
vanish asymptotically in A and we will have a result of the form 



VnA^ (a n -a n ) ^ N(0,v ) 

where 

<7 n = <7 + &oA£+o(A£) 

with b\ > 0. (If b\ = for some choice of (r, n, c) then the parameter a is not identi- 
fied by an estimating function based on that combination.) Also, v\ = corresponds 
to a rate of convergence of the estimator of n 1 / 2 , and any value v\ > corresponds 
to a slower than n 1 / 2 rate of convergence. 

We also note that when b\ > the rate of convergence and asymptotic variance 
of the semiparametric estimator of a are identical at the leading order in A n to the 
expressions one would obtain in the fully parametric, correctly specified, case where 
centering of the estimating equation is done with either (|94(l or (|96|) as appropri- 
ate, instead of the approximate centering using (|9*5|) or (|9*7)) . Centering using the 
latter is of course the only feasible estimator in the semiparametric case where the 
distribution of Y is unknown. 

In what follows, we use the explicitness of this model to fully characterize the 
asymptotic distribution of the semiparametric estimator of a, i.e., (60, 61, t>o, v\) as 
functions of (r, k, c) and the parameters of the model (a, \,r/). 



8.1. Power variations without truncation. In that situation, we have for the 
asymptotic variance: 
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/ rfi+rl \ 

• When < r < 1, we have v\ = and vq = 4j ( -v/TT ^T+^y ~~ 1 J ■ 

• When r = 1, we have u i = and vq = \ ((7r — 2) a 2 + 7rA?7) . 

• When 1< r < 2, we have Vl = r - 1 and v = ^^T^ 4tel- 

As for the bias, when < r < 2 we have b\ = 1 — r/2 and 6o = - — ^p- — . 

Remark 11. The estimator based on power variations converges (not taking the bias 
into consideration) at rate n 1 / 2 only when r < 1. When r > 1 £/te mixture of jumps 
and volatility slows down the rate of convergence (v\ > 0). Whenr = 2, the parameter 
a is simply not identified, as is obvious from the fact that E(X^) = (a 2 + Xrj)A. This 
is also apparent here from the fact that b\ { as r j 2, so the bias no longer vanishes 
asymptotically. And the bias even worsens the rate, of course. 

Remark 12. When r < 1, the asymptotic variance vo is identical to the expression 
obtained without jumps, as was the case when the log-likelihood score was used as an 
estimating equation. When r = 1, the rate of convergence remains n 1 / 2 , but vq is 
larger in the presence of jumps. 

8.2. Power variations with A 1 / 2 truncation. If we truncate the increments ac- 
cording to t(A) = cA 1 / 2 , then v\ = for all values of r G (0, 2] and 

*° 4+2r (f (r {\ + r) - r (i + r, £)) - (r (if) - r (if, ^)) 2 ) 



(V2c^ exp (-£) - 2^r^ (r (if) - r (if, £)))' 



As for the bias, we have &i = 1 and 

*a (r (if )-r 



3+r c 2 
2 ' 2^ 



(r(if)-r(if,^))-2(r(2f)-r 

Remark 13. Truncating at rate A 1 / 2 restores the convergence rate n 1 / 2 for all 
values of r, (again, regardless of the bias) and permits identification when r = 2. 
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When < r < 1 (where the rate n 1 / 2 was already achieved without truncation) , not 
truncating can lead to either a smaller or larger value of Vq than truncating at rate 
n 1 / 2 , depending upon the values of(a 2 ,c). 

Remark 14. The asymptotic variance vo is identical to its expression when no jumps 
are present, as it should be in view of our general results ( as said before, this type 
of truncation leads to the estimators studied in our general results). In all cases, the 
bias is smaller than when no truncation is applied. 

8.3. Power variations with slower than A 1 / 2 truncation. If we now keep too many 
increments by truncating according to r(A) = cA 1 / 2+K , with —1/2 < k < 0, then 
we have for r £ (0, 2] : 

• When —3/ (2 + 4r) < n < 0, we have v \ = and 

a 2 f ^ r(i + r) \ 

• When k = —3/(2 + 4r), we have = and 

2 l/2-r c l+2r^ FA<T 2-2r ^2 / r (1 + r ) \ 

r 2 (1 + 2r)r/ i/2 r (i±r) 2 r^y r(^) 2 y 

• When -1/2 < n < -3/(2 + 4r), we have ^ = -k - 2rn - 3/2 > and 

2 l/2-r c l+2r / ?ACT 2-2r 

r 2( 1 + 2r ) r/ i/2 r (i±i) 2 

As for the bias, we have: 

• When -1/(2 + 2r) < k < 0, we have &i = 1 and 6 = 

. When « = -1/(2 + 2r), we have &x = 1 and 6 = ( ^Tff) " 1 " ?) 

• When -1/2 < k < -1/(2 + 2r), we have b x = 3/2 + k + > and 6 = 

2 l/2-r/2 c l + r Ag .l-r 

r (i +r )^r(i±r) • 
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Remark 15. When < r < 1, we are automatically in the situation where k > 

1 /2 

— 3/(2 + 4?"), and hence keeping more than 0(A n ' ) increments results in the conver- 
gence rate n 1 / 2 and the same asymptotic variance vq as when keeping all increments 
(i.e., not truncating at all). When 1 < r < 2, however, it is possible to restore the 
convergence rate n 1 / 2 (compared to not truncating) by keeping more than 0(An /2 ) 
increments, but still "not too many" of them (— 3/(2 + 4r) < k < 0) beyond that; but 
even keeping a larger fraction of the increments (—1/2 < k < —3/(2 + Ar)) results 
in an improvement over keeping all increments since 3/2 — k — 2rn < r — 1 so that 
the rate of convergence ofa n , although slower than n 1 / 2 , is nonetheless faster than 

1/2 

Remark 16. The expressions for k < do not converge to those with 0(A n ) 
truncation as k j because of the essential singularity of the incomplete T function 
near infinity, given in i 981(1 : when t(A) = cA 1//2+K then T((l + r)/2, •) is evalu- 
ated at r(A) 2 /(2cr 2 A) = c 2 A 2k /(2<t 2 ) and for fixed k < 0, terms proportional to 
exp(— c 2 A 2k /(2ct 2 )) are negligible in the Taylor series in A of vq and &o- This is not 
the case when k = however. 

Remark 17. As for the bias, keeping "too many" but not all increments (—1/2 < k < 
— l/(2 + 2r) ) leads to a smaller bias than keeping all increments, since 3/2 + n + rn > 
l — r/2, but to a larger bias than keeping just the right amount since 3/2 + K + rn < 1. 

8.4. Power variations with faster than A 1 / 2 truncation. Finally, if we keep too 
few increments by truncating according to cA 1//2+K , with k > 0, then v± = k for all 
values of r G (0, 2] and 

y/2n(l + rfa 3 
V ° ~ 2c(l + 2r) 

As for the bias, we have b\ = 1 and bo = a\. 

Remark 18. Truncating at a rate faster than A 1 / 2 deteriorates the convergence rate 
of the estimator from ra 1 / 2 to n 1 / 2 A.^ 2 ; while we successfully eliminate the impact of 
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jumps on the estimator, we are at the same time reducing the effective sample size 
utilized to compute the estimator, which increases its asymptotic variance. 

Remark 19. The expressions for vq and bo for k > also do not converge to those 

1/2 

with 0(A n ) truncation as k j because once again we cannot interchange the order 
of the limits A n — > and k — > 0. 

8.5. Comparison with the general case. Let us compare, in the semiparametric 
case, the specific results just obtained with the general results obtained in Theorems 
0] and El In the present situation we have G G Gq- So these general results assert 
that if 

(99) nA 2 n -> 0, 

then the estimators a n converge at a rate ^/n, and the limit of the normalized 
error is Gaussian without bias; when ()99|) fails but A„ — > yet, then the sequence 
((V^AA- 1 )^-^) is tight. 

The estimators (|9~3|) converge at rate ^fn when v\ = and nA^ 1 is bounded 
(then there is a bias) or nA^ 1 — > (there is no bias). Otherwise, the sequence 
(VHATAA^ 1 )^ - a) is tight. Then: 

• Power variation without truncation: we have a rate i/n only when r G (0, 1] 
and nA^~ r is bounded. Otherwise the rate is always worse than in our general 
results: this was expected, of course. 

• Power variation with A 1 / 2 truncation: If nA 2 — > we have rate y/n with 
asymptotically unbiased error. If nA 2 — » a G (0, do) we have rate y/n with 
asymptotically biased error. If raA 2 — > oo, then A~ 1 (a n — a) converges in 
probability to the constant &o : this is a bit better than what we get by applying 
the general results recalled above. This holds irrespectively of r G (0, 2] (and 
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also for r > 2 here, as a matter of fact), but of course the asymptotic variance 
depends on r, and also on c. 

• Power variation with slower than A 1 / 2 truncation: The rate is yfn if —1/(2 + 
2r) < k < and nA 2 is bounded, or if -3(2 + 4r) < k < -1/(2 + 2r) and 
reA^ +2K+2rK is bounded. This is worse than the previous case. 

• Power variation with faster than A 1 / 2 truncation: The rate is at most \J nAj; , 
and always worst than in the A 1 / 2 truncation case. 



9. Example: Sum of two stable processes. In this last section we consider 
the case where Y is also a symmetric stable process, with index a G (0,(3). Then 



9.1. The empirical characteristic function. First, we can consider estimators 
based on the empirical characteristic function, that is we consider k(x) = cos(wx) 
for some w > 0. We have the parametric estimate a n = a n (G,(ft,k) of Theorem 
(here k is bounded, so <j) is indeed irrelevant). The sequence \/n (a n — a) converges 
in law to N(0, a 2 Y?(k)), where S 2 (/c) is given by (|84j) . On the other hand we have 
the semiparametric estimators a n (k), which by Theorem |S] behaves as such: under 

2(/3-a) 

(100) nA n 0, 

\fn (d n (k) — a) converges in law to N(0, cr 2 S 2 (fc)). And in general the sequence 

0—a. 

(^A A « " )(£„-<t) is tight. 

In fact, since we are in Case 2 the preliminary estimator S n = S n (G) is the 
same in both cases, and a n and a n (k) are the solution of U n (u) = and U' n (u) = 
respectively, which are closest to S n , and the difference between these two estimating 
functions is 

U n (u) - U' n (u) = U n (u) := ¥ G ,An,/J,* (jr, ]r,0) ~ *k (j^,o\ 



42 YACINE AIT-SAHALIA AND JEAN JACOD 

(recall ((79")) and (|85|)1. If we use the explicit forms (jHOj) and we get 



/3-q 

which is equivalent to ^£5- An* 3 e - w P/ 2 as n — > oo and u — > a (recall that S n — ► 
a in probability). Since *' fe (l,0) = -Pe~ wb / 2 + 0, we deduce that the difference 

g-q 

& n {k) — a n is equivalent (in probability) to — {w a /2f5a a )A n 13 . Therefore, in addition 
to the fact that y/n ia n {k) — a) converges in law to iV(0, a 2 T> 2 (k)) under (|1UU() . we 
get 



/3-a 

If nA n p — > a 2 6 (0, oo), then y 7 ^ (a n (k) — a) converges in law to 
iV(-au; a /2/3cj Q ,cj 2 S 2 (A:)), 



g-a g-a 

If nAn' 3 — ► oo, then A n 13 (a n (k) — a) converges in probability to the con- 



stant -w a /2(3a a . 



We conclude that the results of Theorem [51 are sharp, for the particular estimation 
functions k(x) = cos(wx) at least. 

9.2. Truncated power functions. We can do a similar analysis for the estimators 
(jnOjl . based on the truncated power variation V n {^) of (jEHjl with = (because 
Y is symmetrical here). That is, we consider the truncated power variations at the 

o 0-a 

level A n ■ Namely when nA n 13 — ► oo, one can show that, at least when 7 is small 

g— a 

enough (but it is probably true for all 7 > 0), then the sequence A n (a n — a) 
is tight and its limiting distributions include some Dirac masses at non vanishing 
constants. So here again the results of Theorem|2]are sharp. But of course, as already 
said before, this does not completely rule out the existence of estimators constructed 
in a different way and behaving better. 
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10. Conclusions. We exhibited a class of estimators for the volatility param- 
eter a in a model where the driving process Wt is perturbed by another process Y t . 
These estimators can be designed in such a way that they are immune to the pres- 
ence of the perturbation Y t : they are asymptotically efficient, in the strong sense 
that they behave asymptotically like the efficient estimators for the model X t = aWt 
with no perturbing term Yj. 



REFERENCES 

A'it-Sahalia, Y. (2004). Disentangling diffusion from jumps. Journal of Financial Economics 74 
487-528. 

Ait-Sahalia, Y. and Jacod, J. (2004). Fisher's information for discretely sampled Levy processes. 

Tech. rep., Princeton University and Universite de Paris VI. 
Fenech, A. P. (1976). Asymptotically efficient estimation of location for a symmetric stable law. 

Annals of Statistics 4 1088-1100. 
Feuerverger, A. and McDunnough, P. (1981b). On the efficiency of empirical characteristic 

function procedures. Journal of the Royal Statistical Society - Series B 43 20-27. 
Godambe, V.P. (1960). An optimum property of regular maximum likelihood estimation. Annals 

of Mathematical Statistics 31 1208-1211. 
Hansen, L.P. (1982). Large sample properties of generalized method of moments estimators. 

Econometrica 50 1029-1054. 
Heyde, C.C. (1997). Quasi-Likelihood and Its Applications. Springer- Verlag, New York. 
Mancini, C. (2001). Disentangling the jumps of the diffusion in a geometric jumping Brownian 

motion. Giornale dell'Instituto Italiano degli Attuari LXIV 19-47. 
Press, S. J. (1972). Estimation in univariate and multivariate stable distributions. Journal of 

the American Statistical Association 67 842-846. 
Singleton, K. (2001). Estimation of affine asset pricing models using the empirical characteristic 

function. Journal of Econometrics 102 111-141. 
Zolotarev, V. M. (1986). One- dimensional Stable Distributions, vol. 65 of Translations of 

Mathematical Monographs. American Mathematical Society. 



Department of Economics 
Princeton University and NBER 
Princeton, NJ 08544-1021 
E-MAIL: yacine@princeton.edu 



Laboratoire de Probabilities (UMR 7599) 
Universite P. et M. Curie (Paris-6) 
75252 Paris Cedex 05 
e-mail: jj@ccr.jussieu.fr 



